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ABSTRACT 

We present predictions of the correlation between the Lya forest absorption in quasar 
spectra and the mass within ~ 5/i -1 Mpc (comoving) of the line of sight, using fully 
hydrodynamic and hydro-PM numerical simulations of the cold dark matter model 
supported by present observations. The observed correlation based on galaxies and the 
Lya forest can be directly compared to our theoretical results, assuming that galaxies 
are linearly biased on large scales. Specifically, we predict the average value of the mass 
fluctuation, < 5 m >, conditioned to a fixed value of the Lya forest transmitted flux 
Sf, after they have been smoothed over a 10/i _1 Mpc cube and line of sight interval, 
respectively. We find that < 5 m > /a m as a function of Sf/ctf has a slope of 0.6 
at this smoothing scale, where a m and of are the rms dispersions (this slope should 
decrease with the smoothing scale). We show that this value is largely insensitive to the 
cosmological model and other Lya forest parameters. Comparison of our predictions to 
observations should provide a fundamental test of our ideas on the nature of the Lya 
forest and the distribution of galaxies, and can yield a measurement of the bias factor 
of any type of galaxies that are observed in the vicinity of Lya forest lines of sight. 

Subject headings: cosmology: theory—intergalactic medium—large-scale structure of 
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1. INTRODUCTION 

The prevalent theory to explain the Lya forest is that the absorption lines arise from density 
variations in a photoionized intergalactic medium that originate in the gravitational evolution of 
primordial fluctuations. Both semi-analytic models and numerical simulations (Bi 1993; Cen et al. 
1994; Zhang et al. 1995, 1998; Hernquist et al. 1996; Miralda-Escude et al. 1996; Bi &; Davidsen 
1997) have shown that the predicted Lya spectra appear remarkably similar to the observations. 
The good agreement of the predicted and observed flux distribution and power spectrum of the Lya 
forest (Rauch et al. 1997; Croft et al. 1999; McDonald et al. 2000), and the large transverse size 
of the absorption systems (Bechtold et al. 1994; Dinshaw et al. 1994, 1997; Petitjean et al. 1998; 
Monier, Turnshek, & Hazard 1999; Dolan et al. 2000; Lopez, Hagen, & Reimers 2000) are the basic 
tests that have so far been done and have supported the theory. In addition, this Lya forest theory 
is derived from the general Cold Dark Matter (hereafter, CDM) model with parameters that are 
well constrained from several other observations (e.g., Primack 2000). 

Another important test of our ideas of the Lya forest can be done by observing the correlation 
of the transmitted flux in a spectrum with galaxies. Some of these observations have already 
been done for small scales and high column density absorbers, which have shown the expected 
strong correlation between galaxies and gas halos (e.g., Bergeron & Boisse 1991; Steidel, Dickinson, 
& Persson 1994; Lanzetta et al. 1995; Chen et al. 2001). Penton, Stocke, & Shull (2001) have 
probed this correlation at low redshift, and have found that the weak absorption lines are often 
found in low-density regions of the galaxy distribution. Recently, Adelberger et al. (2001) have 
carried out the first analysis of this correlation on large scales (several comoving Mpc) and at high 
redshift (using galaxies detected with the Lyman break technique), using the transmitted flux as 
the quantity to correlate with the mean number of galaxies in a specified region. 

Inspired by the observational results of Adelberger et al. (2001), this paper presents detailed 
theoretical predictions for statistical functions similar to the one introduced by those authors. 
Namely, we analyze the mean value of the mass given an observed value of the transmitted flux, 
and the mean value of the transmitted flux for a fixed value of the mass, after both quantities have 
been smoothed over a certain region. The main difference between the functions we analyze and 
the function shown by Adelberger et al. (2001) is that the observed objects are of course galaxies, 
and our simulations predict only the distribution of the mass. However, the predictions of the 
simulations can still be compared to the galaxy observations to determine the relation between 
galaxies and mass, which is especially straightforward if linear bias is a sufficient description of 
the distribution of galaxies relative to the mass on the large scales being probed. We examine the 
dependence of these statistical functions on the various parameters affecting the Lya forest in §3. 
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2. METHOD 

We use several HPM (hydro-particle-mesh; see Gnedin & Hui 1998) simulations to model the 
Lya forest, after testing that the HPM approximation is sufficient for our purposes by comparing 
to a fully hydrodynamic simulation. The standard HPM simulations that will be used most often 
in this paper have box size 40 /i -1 Mpc, with 512 3 particles. The cosmological model is CDM with 
cosmological constant, in a flat universe with present matter density = 0.4, power spectrum 
index n = 0.95, and amplitude given by erg = 0.75. All the results in this paper will be shown 
at z = 3, the typical redshift at which the observations have so far been done. The model we 
use is consistent with the observed Lya forest power spectrum: at the typical Lya forest scale of 
k = 0.008 (kms -1 ) -1 , it has A 2 p (k) = 0.29 at z = 3 [where A 2 p (k) is the contribution per unit In k 
to the variance of the linear theory mass density fluctuations], while the observed value is ~ 0.26 
(McDonald et al. 2000; Croft et al. 2002). The redshift interval corresponding to 40/i -1 Mpc in 
this model near z = 3 is Az = 0.068. The density-temperature relation assumed for the gas is 
T(A) = Ti. 4 (A/ 1 . 4 ) 7_1 , where A is the gas density divided by the mean gas density. We use the 
parameters T 1.4 = 17000 K and 7 — 1 = 0.3 in our standard simulation (the reason we specify the 
value of the temperature T 1.4 at A = 1.4 is because the error in the temperature was determined 
to be smallest at this density in McDonald et al. 2002). Several other similar simulations are used 
where we vary each one of the relevant parameters (see §3), and we vary also the box size and 
resolution to verify the numerical convergence of the results. Unless otherwise indicated, all the 
results we show for a given set of parameters are averages of four simulations with different random 
initial perturbations. 


2.1. Definitions 

We define the quantity = 1 — F/F , where F is the fraction of transmitted flux in the 
Lya forest after the spectrum has been smoothed along the line of sight with a filter, and F is 
the mean transmitted flux. We use a Gaussian smoothing filter in this paper, which is W ( kR ) = 
exp[— (kR) 2 /2] in Fourier space. We also define S rn as the redshift-space mass density perturbation, 
again smoothed over a specified region. We choose to smooth the mass fluctuation over cubes 
of 10 /? -1 Mpc (comoving) in redshift space, centered on the Lya forest line of sight, which is 
approximately the smoothing that is used in Adelberger et al. (2001). This is convenient to do 
in observations of galaxies, since galaxies are usually searched for in a square field of view, and 
they can simply be divided into redshift bins. Note that the size of our cubes in velocity units, 
1280 km s -1 , is more fundamental than the size in h~ l Mpc, because the temperature and observed 
flux power spectrum are both measured in kms -1 . 

In general, the information that can be recovered from observations is the full joint probability 
distribution P(SF,5 g ), where 5 g is the fluctuation in the number of galaxies. In this paper we 
present results from HPM numerical simulations for < >, which is the mean value of the 
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mass fluctuation subject to the condition of a fixed value in the transmitted flux fluctuation, and 
for the analogous quantity < 8 p \ 8 rn >. The observed galaxies may not follow exactly the same 
fluctuations as the mass, even when smoothed over a scale of 10 h Mpc. However, if galaxies are 
linearly biased at this large scale, then 8 g = b8 m , where 8 g is the fluctuation in the galaxies and b 
is the bias factor. If we now define the quantity 8 m = 8 m /a m , and 8 g = 8 g /a g , where u m and a g are 
the mis fluctuation of 5 m and 5 gi then S m = 8 g and the correlation with 8p is precisely predicted. 
For convenience, we also define the quantity 8f = 8f/<?f, and we will present results for <(5 m |<5p> 
and <5f\ b m >- 


2.2. Smoothing 

To choose a value of the smoothing radius R for the smoothing filter of the Lya forest transmit¬ 
ted flux, we compute first the correlation coefficient < <5 f< 5 m > /crF&m in our standard simulation, 
and show it as a function of R in Figure 1. The correlation is maximum at R = 3.5 h~ l Mpc. The 
dependence of the correlation on R is not surprising: if R is too small, the value of 5p is altered 
by small-scale fluctuations that are not related to the value of S m and therefore act as noise, and 
if R is too big, then the fluctuations affecting 5 m are erased by smoothing in the value of 8f- 

A Gaussian filter with R = 3.5 /i -1 Mpc will be used for the smoothing of the Lya spectrum 
throughout the rest of this paper. However, before proceeding, we examine how much our results 
differ if we use instead a top-hat smoothing for the Lya spectrum, or if we vary the size of the cube 
over which the mass is smoothed. 

Throughout the paper, our results will be presented as a set of four figures showing the four 
functions < 8 m \8F >, < 8 f |<5 m >, < 8 m \8F >, and < 8F\8 m >- hi all these figures, the function 
< 8 m | 8f > has been obtained by creating spectra along each row of cells in the simulation, smoothing 
the spectra along the line of sight, selecting all the pixels in the spectra where the value of 8f is 
inside a given bin, and computing the average 8 m for these pixels, where 8 rn is computed in redshift- 
space. We use bins of width A 5f = 0.05. Similarly, < 8p\8 m > is calculated by averaging the values 
of 5f for pixels in bins with a given 8 m (again using A 8 m = 0.05 for the bin width). 

In Figure 2(a-d), we compare the results for our standard Gaussian filter (black, solid line) to 
the results obtained if we simply average the Lya forest flux across the 10 h -1 Mpc extent of the 
cube with which the mass is smoothed, i.e., use a top-hat filter with R = 5 h _1 Mpc (red, dotted 
line). The difference is not large, although noticeable, for the S m -Sp relations, and is practically 
zero for the 8 m -5F relations. 

We also show in Figure 2 the effect of changing the size of the cube used to smooth the mass 
density field from 10/? _1 Mpc to 12 h -1 Mpc (green, long-dashed line) or 8/i -1 Mpc (blue, short- 
dashed line). In both of these cases we also change the smoothing length for the spectrum by the 
same factor. As the size of the smoothing cube is reduced, the mass fluctuations increase and they 
are more strongly correlated with the Lya forest, as expected. We see that for < 8 rn \8p > it is 
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Fig. 1.— Correlation between mass and transmitted flux as a function of the smoothing applied on 
the spectrum, with the Gaussian filter, W{kR ) = exp[— (kR)/2\. The squares show the correlation, 
< 5f > /(?F 0"m)* 
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Fig. 2.— Comparison between our standard 3.5 h _1 Mpc Gaussian smoothing of the spectrum with 
10 h Mpc cube smoothing of the mass (solid, black line ) and 5 h^ 1 Mpc top-hat smoothing of the 
spectrum (dotted, red line), 12 h^ 1 Mpc cube smoothing of the mass with 4.2/i -1 Mpc Gaussian 
smoothing of the spectrum (long-dashed, green line), or 8/i -1 Mpc cube smoothing of the mass 
with 2.7 h~ l Mpc Gaussian smoothing of the spectrum (short-dashed, blue line), (a) mean mass 
as a function of transmitted flux, (b) mean flux as a function of mass, (c and d) are as (a and 
b) except that dp and 5 m are divided by their rms fluctuation. The mass is always computed in 
redshift space. 
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important to match the dimensions of the cube in the simulations and observations to compare 
the two. Note, however, that the transverse angular size of the cube depends on the cosmological 
geometry. The other plots show less sensitivity to the cube size. 

To promote comparison between our results and observations or other simulations, in Table 1 
we give < <5 m |<5/r > and <<5p|<5 m > for the three cube sizes, along with a f and a m needed to convert 
from 5 to 5. The rows in the Table labeled 2s, 21d, and 2sd correspond to cube size 10 h~ l Mpc, 
12 h _1 Mpc, and 8 h _1 Mpc, respectively. 


2.3. Test of the HPM Approximation 

We resort to approximate HPM simulations (Gnedin & Hui 1998) because it is impractical 
at the present time to perform fully hydrodynamic simulations of the required size for all of the 
parameter variations we would like to explore. However, we first test the accuracy of the HPM 
approximation by comparing to a state of the art hydrodynamic simulation of a similar cosmological 
model. This simulation is Eulerian, with box size 25 /r _1 Mpc divided into 768 3 cells for baryons, 
with 384 s dark matter particles (see Cen et al. 2001 for a more complete description). We compare to 
a 25 /i” 1 Mpc, 512 3 particle HPM simulation with identical initial Fourier modes up to the Nyquist 
frequency of the mesh (as we show in the next subsection, the resolution of these simulations is 
sufficient for convergence of our chosen statistics, so we do not need to worry about exactly how to 
equate resolution between the two). 

The comparison for our statistics is shown in Figure 3, where the solid lines are the HPM 
results and the dotted lines are the fully hydrodynamic results. The agreement is excellent 
except in the highest Sf regions in the < 5 m \5p > and < 5 m \5 f > comparisons (a and c). For 
reference, in Figure 3(a and b) we plot the PDFs of 5f and S m , respectively (defined as the relative 
volume-weighted probability of finding <5 within each of our usual bins). We see that the regions 
where the HPM approximation breaks down are extremely rare (at low S m in Figure 2b the lines 
terminate at the lowest density found in the simulations). 


2.4. Convergence of the Results with Resolution and Box Size 

We start by testing the sensitivity of our numerical results to the resolution. For this purpose, 
we use boxes with size 20h~ 1 Mpc in addition to our standard value of 40/i _1 Mpc. 

The results for the four functions, < >, < i>V|hm >, < 4 m |i>V >, and < 5p| 6 m >, are 

shown in Figures 4(a,b,c,d) for our standard cosmological model, with box size 20/i _1 Mpc, and 
resolution of 512 3 and 256 3 particles for the long-dashed and short-dashed lines, respectively. The 
solid and dotted lines show results with box size 40 h” 1 Mpc, again with 512 3 and 256 3 particles, 
respectively. The agreement between the 20 h~ l Mpc simulations differing only in the resolution 
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Fig. 3.— Test of the HPM approximation. Black (solid) lines: HPM results; red (dotted) lines: 
fully hydrodynamic results; green (dashed) lines: PDF of Sp (a) and 5 m (b). The normalization of 
the PDFs is arbitrary. 
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Fig. 4.— Resolution tests. Green (long-dashed) line: 20 h~ l Mpc simulation box with 512 3 particles; 
blue (short-dashed) line: 20/i -1 Mpc, 256 3 particles. Black (solid) line: 40/i _1 Mpc, 512 3 particles; 
red (dotted) line: 40 h^ 1 Mpc, 256 3 particles, (a) mean mass as a function of transmitted flux, (b) 
mean flux as a function of mass, (c and d) are as (a and b) except that 6f and S m are divided by 
their rms fluctuation. The mass is always computed in redshift space. 
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is excellent, verifying that 40/r -1 Mpc simulations with 512 3 particles should be well resolved. The 
agreement between 40 /r _1 Mpc simulations with different resolution is still quite good except at 
high 6f, indicating that 80 h -1 Mpc, 512 3 particle simulations could still be useful. 

Figure 4 shows a disturbingly large difference between the results for 20 h~ l Mpc and 40 h Mpc 
boxes; however, assessing the effect of the box size is more difficult than for the resolution, because 
we cannot use the same set of initial mode amplitudes at each wavenumber, so the statistical fluc¬ 
tuations in the simulations introduce significant random differences in the result. Figures 5(a-d) 
compare results for 40 h^ 1 Mpc boxes (black, solid lines), 20h~ 1 Mpc boxes (red, dotted lines), and 
80 h _1 Mpc boxes (green, dashed lines), all with 512 3 particles. The thick lines show the average of 
four simulations, and the thin lines show the results from each separate simulation to demonstrate 
the statistical error in the curves. 

Figure 5(a) shows good agreement between the 40 and 80 h~ x Mpc boxes for < S m \dF >, except 
at large 5f, where the trend toward increasing mean mass with increasing box size from 20 to 40 to 
80 h _1 Mpc is probably not a result of statistical fluctuations. In fact, the true difference between 
the 40 and 80 h^ 1 Mpc boxes is even larger than what is shown because the decreased resolution in 
the 80 h -1 Mpc box has the effect of suppressing <5 m \5F> at large 5f (see Figure 4a). It is clear 
that much of the flattening of <d m |<5p> that we see at large 5f is an effect of the finite box size. 
Fortunately, the other statistics shown in Figure 5(b-d) generally exhibit better convergence than 
< | $F > • 

The results in Figures 5(a-d) should also serve as a cautionary reminder that statistical fluctua¬ 
tions from one simulation to another can be large, and must be taken into account when comparing 
to observations or to other simulations, especially single, relatively small simulations. Surprisingly, 
the scatter between the separate simulations in the high 5f corner of the plot is not reduced by 
increasing the box size from 40 to 80 /r _1 Mpc. This may be an indication that the statistical 
fluctuations in the result are still dominated by the longest wavelength modes in the box. 

For reference, for our standard model, in a 40/? -1 Mpc box with our standard smoothing, the 
mis fluctuations in Sf and S m are op = 0.16 and a m = 0.30. 

3. PARAMETER DEPENDENCE OF THE LYa FOREST - MASS 

CORRELATION 

We now examine the dependence of the four functions < 5 m \5p >, < Sf\ 5 m >, < 5 m \5F >, 
and < 5f\ d m . > on the most important parameters determining the properties of the Lya forest. 
These are the mean transmitted flux, the mean temperature-density relation of the intergalactic gas 
(parameterized as T = ToA 7-1 ), and the amplitude and power-law slope of the power spectrum. 
These parameters fully determine all the physical effects that are incorporated in an HPM simu¬ 
lation and the calculation of the Lya spectra: the underlying dark matter fluctuations (assumed 
to be Gaussian initially), the smoothing of the gas distribution on the Jeans scale, and the neutral 
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Fig. 5.— Box size and statistical error test. The thin curves are from four independent simulations, 
the thick curves are the average of the four. Black (solid) lines: 40 /i -1 Mpc simulations; red (dotted) 
lines: 20 /r _1 Mpc simulations; green (dashed) lines: 80 h~ 1 Mpc simulations. All with 512 3 particles 
(the 80 h^ 1 Mpc simulations are slightly under-resolved). 
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fraction of the gas and thermal broadening. 


3.1. Mean transmitted flux 

The value of the mean transmitted flux used in our standard model, F = 0.67, is appropriate 
for z = 3. The variation of the mean transmitted flux is the most important factor accounting for 
the changes in the Lya forest with redshift. 

Figures 6 (a-d) show the four functions defined previously for F = 0.67 (black, solid line), 
F = 0.8, appropriate for z ~ 2.5 (green, long-dashed line), F = 0.6, appropriate for z — 3.5 (blue, 
short-dashed line), and F = 0.7 (red, dotted line). 

The functions < 5 m \8p> and <5i?|(5 m > depend strongly on F. whereas the functions with the 
normalized quantities < S m \Sf > and < t>V<5 m > have a much weaker dependence. Essentially, the 
mean transmitted flux changes the effective “bias” in the Lya forest that determines the value of Sf 
that corresponds to a given mass fluctuation, and once a linear bias is eliminated the dependence 
on F is much smaller. Note that Figures 6 a,b reflect the fact that the Lya forest bias is larger for 
higher mean flux decrements (see Figure 10c of McDonald 2002, and Croft et al. 1999, McDonald 
et al. 2000 ). 

We give the results for F = 0.6 and F = 0.8 in Table 1, as rows 6 sd and 61d, respectively. The 
first row in the Table already contains the results for F = 0.67. 


3.2. Temperature-density relation 

The temperature-density relation has practically no effect on the functions we are analyzing, 
as shown in Figures 7(a-d). The reason for this insensitivity is that the temperature-density 
relation does not affect the large-scale properties of the Lya forest directly, and apparently its effect 
through the large-scale bias factor is quite small (see Figure 10b in McDonald 2002). 


3.3. Amplitude of the power spectrum 

In Figure 8, we show the variations with the amplitude of the mass power spectrum by in¬ 
creasing the amplitude in our standard model, shown by the solid line, by 33%, to produce the 
dotted line. In practice, we do this by simply using an output of the same simulation at a scale 
factor 1.33 times larger, but keeping the mean transmitted flux in the Lya spectrum constant (this 
is completely equivalent to running a new simulation with a different amplitude, except that the 
effective temperature changes due to the change in the relation between comoving distance and 
velocity, but the previous figures show that the temperature does not affect the results). The 
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Fig. 6.— Effect of mean transmitted flux fraction. Shown are F = 0.67 (black, solid line), F = 0.8, 
(green, long-dashed line), F = 0.6, (blue, short-dashed line), and F = 0.7 (red, dotted line), for 
the same four functions as previously. 
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Fig. 7.— Effect of temperature-density relation. Black, solid line: T 1.4 = 17000 K, 7 — 1 = 0.3 
(standard values); red, dotted line: T 1.4 = 22000 K, 7 — 1 = 0.3; green, dashed line: T 1.4 = 17000 
K, 7 — 1 = 0.6. 
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Fig. 8.— Effect of the mass power spectrum. Black, solid line: Standard power spectrum. Red, 
dotted line: rms amplitude increased by 33%. Green, dashed line: n = 0.85 instead of n = 0.95, 
with fixed amplitude at k = 1 ( h _1 Mpc)^ 1 . 
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result reflects that the bias factor of the Lya forest decreases with increasing power spectrum am¬ 
plitude (see Figure 10a of McDonald 2002). The difference is partially cancelled when using the 
normalized variables Sf and S m , although less so for the function < 5p \ 8 rn >. 

We also vary the power-law slope of the power spectrum in Figure 8, by changing n from 0.95 
to 0.85 while holding the power fixed at wavenumber k = 1 h Mpc. The effect we see is relatively 
small. Had we carefully chosen the value of k at which we hold the power fixed to minimize the 
variation with n, the effect might be even smaller. 

We give the results for as = 1 and n = 0.85 in Table 1, as rows 8d and 81d, respectively. The 
first row in the Table is for the standard power spectrum. 


4. DISCUSSION 

We have presented predictions for the expected correlation of the mass and the Lya forest 
transmitted flux, smoothed over a cube size of ~ 10/i _1 Mpc. Our most basic result is that the 
relation between < <5 m |<5ir > and Sp is linear over most of the range of 5f (excluding rare, high 
Sp values), with a slope of 0.6 on this smoothing scale. We have shown that this correlation is 
not sensitive to the temperature-density relation of the gas. There is a dependence on the mean 
transmitted flux and the amplitude of the mass power spectrum, but these quantities are already 
measured to reasonable accuracy (Croft et al. 1999, McDonald et al. 2000). We have also shown 
that the predictions do not suffer from uncertainties due to the resolution of the simulations. 
Calculations with even larger boxes than used here will be desirable, because the uncertainties in 
the predictions due to the variance in the simulations and the suppression of the large-scale power 
may still be significant; however, this uncertainty is mostly isolated in the most rare, high density 
regions. 

A similar relation holds between <Sf |h m > and S m . We notice, however, that the observational 
determination of this other function will be affected by galaxy shot noise. The number of galaxies 
observed in a certain cube in redshift space containing a fixed mass is subject to shot noise, and this 
inevitably introduces a smoothing of the function <5p\5 m >, which must be taken into account 
before any comparisons to our theoretical results are made (in contrast, galaxy shot noise does 
not change the average < dp\S m >, but it does alter the mis dispersion op needed to compute 
5f). Measuring the effects of galaxy shot noise can also teach us useful information about how 
galaxies form, because galaxy shot noise does not generally need to be strictly described by Poisson 
statistics. For example, for a fixed mass contained within a cube, once a galaxy is found in the 
cube the probability to find others may be lower because some mass has already been used up by 
that galaxy. 

Comparing these theoretical predictions with observations allows for several tests of the basic 
theory of the Lya forest, and can reveal new information on the spatial distribution of the galaxies. 
The functions <5 m \5F > and <8f\S m >, which should not be affected by any linear galaxy bias, 
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provide a powerful test of the assumption that the basic framework assumed here for the nature of 
the Lya forest is correct, and that galaxies trace the mass on large scales apart from linear bias. 
If the slope of < 5 m \5F > as a function of 5p were found to be larger than predicted, this would 
imply that the Lya forest is much more closely associated with galaxies than is expected from their 
common correlation with the mass distribution. If the observed slope were smaller than predicted, 
it would indicate that the Lya absorbing gas and the galaxies tend to avoid each other for some 
reason. 

If the correlation of 5 m and dp is as expected, this will imply a strong confirmation of the basic 
model we have for the Lya forest, and will justify using the Lya forest as a predictor of the mass 
fluctuations. Comparing the predicted function <5 m \5F > with the observed < 6 g \5F > will then 
yield the bias factor of any type of galaxies for which these observations can be made. 

The correlation of galaxies and the Lya forest can be measured as a function of scale. Our 
predicted value of 0.6 for the slope of < 5 m \5F > as a function of dp should decrease with the 
smoothing scale, in a way that reflects the shape of the mass autocorrelation function. This will 
allow a precise test of the idea that the large-scale distributions of different types of galaxies differ 
only in a constant linear bias relative to the mass fluctuations. 


We acknowledge useful discussions with Kurt Adelberger, Charles Steidel, and David Weinberg. 
We thank Nick Gnedin for the HPM code. PM is supported by a grant from the Packard Foundation. 
RC is supported by grants AST93-18185 and ASC97-40300. JM is supported by grant NSF-0098515. 
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Table 1. Selected Results 


Fig., 

line 

OF 

Om 

< 8 m \SF > 

5 f = -1.5 

0 

1.5 

3 

< > 

8 m = “I 

0 

1 

2 

2 , s a 

0.16 

0.30 

-0.97 

0.07 

0.80 

1.17 

-0.61 

0.09 

0.58 

1.00 

2 , Id b 

0.15 

0.26 

-0.95 

0.09 

0.73 

1.03 

-0.56 

0.09 

0.53 

0.94 

2 , sd c 

0.18 

0.36 

-1.00 

0.05 

0.87 

1.35 

-0.68 

0.12 

0.63 

1.05 

6 , Id d 

0.11 

0.30 

-1.09 

0.10 

0.86 

1.26 

-0.63 

0.06 

0.60 

1.11 

6 , sd e 

0.19 

0.30 

-0.92 

0.06 

0.77 

1.12 

-0.60 

0.11 

0.56 

0.94 

GO 

0.18 

0.39 

-0.98 

0.06 

0.84 

1.27 

-0.65 

0.12 

0.60 

1.03 

8, Id s 

0.17 

0.33 

-1.00 

0.07 

0.84 

1.21 

-0.64 

0.11 

0.60 

1.03 


Note. — First column gives the Figure and curve to which the row of results applies. Solid line=s, 
dotted=d, long-dashed=ld, and short-dashed=sd. Value for < 8 m \5p = 3 > underestimated due to 
limited box size. a m and ap intended only for use as conversion factors from 5 to <5 - alone they 
may be sensitive to resolution and box size. 

a Standard case. 

b 12 h~ l Mpc cube, 4.2 h~ x Mpc Gaussian. 
c 8hr 1 Mpc cube, 2.7 hr 1 Mpc Gaussian. 
d F = 0.8. 
e F = 0.6. 
f o- 8 = 1 


% = 0.85. 







